Deep Reinforcement Learning for Crowdshipping Last-Mile Delivery with Endogenous Uncertainty

نویسندگان

چکیده

In this work, we study a flexible compensation scheme for last-mile delivery where company outsources part of the activity delivering products to its customers occasional drivers (ODs), under named crowdshipping. All deliveries are completed at minimum total cost incurred with their vehicles and plus paid ODs. The decides on best offer ODs planning stage. We model our problem based stochastic dynamic environment orders volunteering make present themselves randomly within fixed time windows. uncertainty is endogenous in sense that influences availability. develop deep reinforcement learning (DRL) algorithm can deal large instances while focusing quality solution: combine combinatorial structure action space neural network approximated value function, involving techniques from machine integer optimization. results show effectiveness DRL approach by examining out-of-sample performance it suitable process samples uncertain data, which induces better solutions.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stochastic last-mile delivery problems with time constraints

When a package is shipped, the customer often requires the delivery to be made within a particular time window or by a deadline. However, meeting such time requirements is difficult, and delivery companies may not always know ahead of time which customers will need a delivery. In this thesis, we present models and solution approaches for two stochastic last-mile delivery problems in which custo...

متن کامل

Transshipment Networks for Last-Mile Delivery in Congested Urban Areas1

This paper introduces the concept of transshipment networks, a collection of strategically located transshipment platforms, for efficient and flexible last-mile delivery in congested urban areas. By implementing transshipment platforms, logistics operators can select the locations, light-freight vehicle types and operating schedules that best fit specific distribution strategies, and, simultane...

متن کامل

Minimizing waiting time in last mile delivery for smart city

Customers consider a reliable and on-time last mile delivery in Omni-channel business as important as price and good’s quality. We focus on minimizing the waiting time for customers and carriers in last mile delivery. We model the problem as an optimization problem and develop Genetic Algorithm to solve it.

متن کامل

Naive Reinforcement Learning with Endogenous

This article considers a simple model of reinforcement learning. All behavior change derives from the reinforcing or deterring effect of instantaneous payoff experiences. Payoff experiences are reinforcing or deterring depending on whether the payoff exceeds an aspiration level or falls short of it. Over time, the aspiration level is adjusted toward the actually experienced payoffs. This articl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Mathematics

سال: 2022

ISSN: ['2227-7390']

DOI: https://doi.org/10.3390/math10203902